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ABSTRACT 

We constrain the basic comological parameters using the first observations by the Very Small 
Array (VSA) in its extended configuration, together with existing cosmic microwave back- 
ground data and other cosmological observations. We estimate cosmological parameters for 
four different models of increasing complexity. In each case, careful consideration is given to 
implied priors and the Bayesian evidence is calculated in order to perform model selection. We 
find that the data are most convincingly explained by a simple flat ACDM cosmology without 
tensor modes. In this case, combining just the VSA and COBE data sets yields the 68 per cent 
confidence intervals Q b h 2 = 0.034+^, Q dm h 2 = 0.18+^, h = 0.72+°;|?j, n s = 1.07+°;^ 

and 08 = 1.17^Q 2o- The most general model considered includes spatial curvature, tensor 
modes, massive neutrinos and a parameterised equation of state for the dark energy. In this 
case, by combining all recent cosmological data, we find, in particular, 95 percent limit on the 
tensor-to-scalar ratio R < 0.63 and on the fraction of massive neutrinos f v < 0.11; we also 
obtain the 68 per cent confidence interval w = — 1 .06l{j 25 on tne equation of state of dark 
energy. 
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1 INTRODUCTION 

In the past two years, a number of experiments have produced accu- 
rate measurements of the power spectrum of anisotropies in the cos- 
mic microwave background (CMB) radiation on a range of angular 
scales (Hanany et al. 2002; Netterfield et al. 2002; Halverson et al. 
2002; Sievers et al. 2002; Benoit et al. 2002). These data, together 
with other cosmological observations, have been used to place in- 
creasingly tight constraints on the values of cosmological parame- 



ters in current models of the formation and evolution of structure in 
the Universe. 

In this letter, we repeat this process with the inclusion of the 
latest observations from the Very Small Array (VSA) in its ex- 
tended configuration. Results from the VSA in its compact config- 
uration have already been presented in Watson et al. (2002), Taylor 
et al. (2002), Scott et al. (2002) and Rubino-Martin et al. (2002) 
(hereafter Papers I - IV). In Grainge et al. (2002, hereafter Paper V) 
these data are combined with the new extended configuration ob- 
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Table 1. The priors assumed for the basic parameters common to all four 
cosmological models under consideration. The notation (a, b) for parameter 
x denotes a top-hat prior in the range a < x < b 



Table 2. The values and priors of assumed for the basic parameters defining 
the four cosmological models described in the text. The notation (a,b) for 
parameter x denotes a top-hat prior in the range a <x <b. 



Basic parameter Prior Model A Model B Model C Model D 



(O b (0.005,0.80) il k (-0.25,0.25) (-0.25,0.25) (-0.25,0.25) 

(O dm (0.01,0.9) / v (0,0.2) 

h (0.4,1.0) w -1 -1 -1 (-1.5,0) 

n s (0.5.1.5) R (0,2) (0,2) 

Zre (4,20) n, (-1,0) (-1,0) 

10 10 A S (0, 100) ^^^^^^^^^^^^=^^^^^^^^^^^^= 



servations to produce a combined power spectrum with 16 spectral 
bins spanning the range i = 160 — 1400. This joint set of observed 
band-powers provides powerful new constraints on cosmological 
parameters. In this letter we extend the traditional likelihood ap- 
proach of previous analyses to a fully Bayesian treatment, including 
careful consideration of our knowledge of cosmological parameters 
prior to the inclusion of any data, and the calculation of Bayesian 
evidences to perform model comparisons. 



2 MODELS, METHODS AND PRIORS 

We restrict our attention to cosmological models in which the 
initial density fluctuations are adiabatic with a simple power-law 
spectrum; such perturbations are naturally produced in the stan- 
dard single-field inflationary model. We assume the contents of 
the Universe to consist of three components: baryonic matter, dark 
matter and vacuum energy, with (present day) densities denoted 
by £2b> f^dm and respectively, measured as a fraction of the 
critical density required to make the Universe spatially-flat (with 
+ Qim + = ^tot)- 



2.1 Model parameterisation and priors 

The parameterisation of the cosmological model can be performed 
in numerous ways, although it is generally preferable to use 
physically-motivated parameters along principal degeneracy direc- 
tions. To this end, in the most general case, we describe the cos- 
mological model using the following 11 parameters: the Hubble 
parameter h (defined as Hq — h x lOOkms -1 Mpc -1 ); the physi- 
cal baryon density (% = Cl^ti 2 ; the physical dark matter density 
COdm = £l(j m /z 2 ; the curvature density £\ = 1 — £2 t ot; the fraction f v 
of dark matter in the form of massive neutrinos; the parameter w 
describing the equation of state of the dark energy (p = wp), the 
redshift of (instantaneous) reionisation z re ; the amplitude of scalar 
modes A s ; the spectral index of scalar modes n s ; the amplitude ra- 
tio R of tensor to scalar modes and the spectral index of the tensor 
modes n t . 

We consider four separate models of increasing complexity, 
in which a successively larger number of the above parameters are 
are allowed to vary. Model A assumes spatial flatness, no massive 
neutrinos and no tensor modes; Model B includes the possibility of 
non-zero curvature; Model C additionally allows for the presence of 
tensor modes; and finally Model D also allows for a contribution to 
the dark matter in the form of massive neutrinos and also a variable 
w. The variable parameters common to all four models are listed 
in Table 1, together with the top-hat priors assumed for each. The 



values and priors assumed for the other parameters are shown in 
Table 2 for each of the four models under consideration. 

The basic parameters described above completely define the 
models considered. It is of interest, however, also to consider the 
following derived parameters: O.^,, £2dm> = &b +^dm> ^A> tne 
age of the universe, the present day rms fluctuation in 8 h~ l Mpc 
spheres as predicted by linear theory rjg, and the optical depth to 
the surface of the last scattering x. In addition to the priors listed 
in Table 1 and 2, we also impose the additional constraints fl A ^ 
and a top-hat prior on the age of the Universe lying between 10 and 
20 Gyr. 

2.2 Bayesian analysis using MCMC samping 

Our approach to Bayesian parameter estimation and model selec- 
tion makes use of Markov-Chain Monte Carlo (MCMC) sampling 
to explore the posterior distribution of the cosmological parame- 
ters (see e.g. Paper IV). For any given model M, samples are drawn 
from the (unnormalised) posterior distribution given by the product 
of the likelihood and prior, i.e. Pr(rf|f?,M)Pr(f?|M), where d de- 
notes the data under analysis and 6 denotes the parameters defining 
the model. The likelihood function is evaluated in the same manner 
as discussed in Paper IV, and the joint prior is simply the product 
of the individual priors discussed in the previous section. 

The particular implementation of the MCMC approach used 
is a slightly adapted version of the Cosmo-mc software package 
(Lewis & Bridle 2002). This sampler uses CAMB (Lewis, Challinor 
& Lasenby 2000) as its underlying power spectrum engine, and is 
specifically tailored to the analysis of CMB data. It achieves high 
acceptance rates and rapid convergence by using a proposal func- 
tion that exploits the difference between 'fast' and 'slow' parame- 
ters in CAMB. 

Given an ensemble of samples from the posterior probability 
distributions, we can easily obtain one-dimensional marginalised 
posterior distributions for any desired parameter, which thus deter- 
mine the constraints implied by the data. Moreover, MCMC tech- 
niques provide a natural way to perform model selection through 
the evaluation of the Bayesian evidence (see e.g. Hobson, Bridle & 
Lahav 2002). For a given model M, the evidence is given by 

E = Pr{d\M) = Jpr(d\0,M)Pr(0\M)d0, 

which is simply the average of the likelihood over the prior. The 
value of the evidence naturally incorporates the spirit of Ockham's 
razor: a simpler theory, having a more compact parameter space, 
will generally have a larger evidence than a more complicated the- 
ory, unless the latter is significantly better at explaining the data. 
Thus, the problem of model selection is answered simply by iden- 
tifying the model with the largest evidence. We have extended the 
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Figure 1. The one-dimensional marginalised probability distributions for 
cosmological parameters in Model A (thin line) and Model B (thick line) 
using priors alone. Thinned samples from the MCMC chains are plotted 
binned into histograms. The y axis shows the number of samples in arbitrary 
units. 
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Figure 2. Marginalised posterior probability distributions of parameters 
from VSA and COBE data alone. The thin line corresponds to Model A 
and the thick line to Model B. Thinned samples from the MCMC chains are 
plotted binned into histograms. The y axis shows the number of samples in 
arbitrary units. The horizontal axes from Fig. 1 are retained. 



Cosmo-mc program to include the calculation of the evidence by 
thermodynamic integration (see e.g. Hobson & McLachlan 2002). 
This is implemented by numerically evaluating the appropriate in- 
tegral during the 'burn-in' of the Markov chain. 'Burn-in' samples 
are discarded for the purpose of the parameter estimation. 

The resulting software was run on a 24-node Linux cluster, 
with each node propagating an independent chain. After burn-in, 
typically 5000 accepted samples were drawn from each chain, 
yielding a total of 120,000 accepted samples. Since successive sam- 
ples from a Markov chain are, by nature, correlated, the accepted 
samples were thinned by a factor of 15 before analysing them fur- 
ther; this resulted in 8,000 independent samples. We have found 
that our final results are very insensitive to the thinning factor. Since 
each chain is run independently, we also obtained 24 separate es- 
timates of the evidence, which are used to obtain an average value 
for E and an estimate of the associated error. 

For calculation of confidence limits we use the 0.165, 0.5 and 
0.835 points of the cumulative probability distribution. Thus, our 
parameter estimate is the median of the marginalised posterior pdf 
and the confidence interval encompasses 67% of the probability. 
Using the median instead of the maximum of the posterior has 
two advantages. Firstly, it gives consistent results under monotonic 
parameter transformation and combination (e.g. the median (£>/, is 
equal to the median Q,/, multiplied by the square of the median h). 
This is especially advantageous in the case of cosmological pa- 
rameter estimation when it is not always clear which parameters 



should be considered basic (for more information see Jaynes 2002, 
p. 621). Secondly, the method is very robust when dealing with 
MCMC chains: the samples may be simply sorted and searched 
with no need for binning and smoothing. Of course, quoting best 
estimates and confidence limits is simply a means of character- 
ising the full one-dimensional marginalised distributions for each 
parameter. Finally, for Models A and B, the parameter estimation 
presented here has been repeated independently using the standard 
grid-based method and the results are consistent. We note that the 
grid based approach would be computationally prohibitive for more 
complex Models C and D. 

2.3 Parameter constraints from priors alone 

The priors on the individual basic parameters are summarised in Ta- 
bles 1 and 2. As noted earlier, we also adopt the constraints (on de- 
rived parameters) Q.^> and the top-hat prior (10,20) Gyr on the 
age of the Universe. It is of interest to determine the effect of these 
combined priors alone on the one-dimensional marginalised poste- 
riors for each parameter. Using the MCMC sampler, this is easily 
performed by simply setting the likelihood to a constant value, i.e. 
the analysis is performed using no data. The resulting marginalised 
distributions for Model A and Model B are shown in Fig. 1 . 

These results are worthy of some discussion. Consider, for ex- 
ample, the distribution for h. This has a clear peak around h 0.7, 
which may be understood as follows. Models with low h are rel- 
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Figure 3. As for Fig. 2 but using data from all considered CMB experi- 
ments. 
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Figure 4. As for Fig. 2 but using all the data sets considered. 



atively disfavoured because they would require £l A < even for 
modest values of co m = C0(j m + COj,, while models with high h are 
disfavoured because they tend to have ages below 10 Gyr. Simi- 
lar arguments may be applied to explain the shapes of the other 
distributions. In particular, we note that, although broad, the dis- 
tributions for Cl m and Cl A peak at 0.3 and 0.7 respectively. The 
non-uniformity of the distributions in Fig. 1 represents the effect 
of the sensible, and seemingly innocuous, initial constraints on the 
chosen parameters. We have also repeated the analysis for Model 
D, and find that the resulting marginalised distributions for the ex- 
tra parameters R, n t , fv and w accurately follow the initial flat prior. 
Having made explicit the impact of our priors, we can now modu- 
late the above distributions by including increasing amounts of data 
from various cosmological observations via the likelihood function. 



3 RESULTS 

We consider three different sets of data in our analysis. Firstly, 
we use only VSA data (from both extended and compact con- 
figurations using the main binning from Paper V), together with 
the COBE power spectrum points (Smoot et al. 1992) to con- 
strain the low-/ normalisation. For this data set, we investigated 
just Models A and B. Secondly, we also include data from several 
recent CMB experiments, namely Maxima (Hanany et al. 2002), 
Boomerang (Netterfield et al. 2002), DASI (Halverson et al. 2002), 
CBI (Sievers et al. 2002) and Archeops (Benoit et al. 2002). For 
these data, we consider Models A, B and C. Finally, we include a 
wide range of additional independent cosmological probes to ob- 
tain the tightest possible constraints on the model parameters. In 



addition to CMB observations, we include constraints from: the 
HST Key Project (Freedman et al. 2001); the first 147,000 red- 
shifts measurements from the 2dF Survey (Colles et al. 2001) on 
scales 0.02 < k/{hM^ 1 ) < 0.15 (Lewis and Bridle 2002; Perci- 
val et al. 2002); nucleosynthesis (Buries, Nollett & Turner 2001); 
and type IA supernovae (Perlmutter et al. 1999). We also include 
the constraint from the gas fraction in relaxed clusters (Allen, 
Schmidt & Fabian 2002; Allen et al. 2002). We do not use the 
local X-ray luminosity function constraints from the cited papers 
due to possible systematic uncertainties. Following Allen et al., we 
marginalise over the cluster bias parameter. Throughout the analy- 
sis we marginalise over applicable calibration and beam uncertainty 
(see Paper IV). 

The resulting one-dimensional marginalised posterior distri- 
butions, for each of the three data sets, are shown in Figs 2-4 for 
a selection of interesting parameters. The corresponding parame- 
ter estimates, confidence limits and evidences are given in Table 
3. Note that the evidence values have been scaled so that, for each 
data set, Model A has an evidence E of unity. 



4 DISCUSSION AND CONCLUSIONS 

By comparing the effective priors on the cosmological parameters 
plotted in Fig. 1 with the posterior distributions inferred from the 
VSA and COBE data alone (Fig. 2), we see that the main strength 
of the VSA lies in its large f-range, which allows one to constrain 
a wide variety of cosmological parameters. In particular, we ob- 
serve that the VSA data significantly improves the constraints on 
C0b, C0(j m , £\, <3s and n s . We note that, as a direct consequence of the 
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Table 3. Constraints on cosmological parameters; consult text for details. If a parameter is constrained, the confidence limits were obtained by calculating 
0.165, 0.5 and 0.835 points of the cumulative probability distribution. When the marginalised posterior for a parameter does not contain a peak, 95% confidence 
limits are given. We also quote evidence values, but note that these can only be compared between models when considering the same data set. 
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pronounced third peak in the VSA power spectrum, the preferred 
value for Cui, is somewhat larger than that from the nucleosynthe- 
sis constraint (Buries et al. 2001) or that from the combined CMB 
data. This excess is statistically significant only at the 1.6a level. 
VSA and COBE data alone do not, however, provide significant 
new constraints on either h or £2 A , especially for Model B. 

On comparing Figs 2 and 3, we see that, as one would expect, 
the constraints on all parameters are tightened by the inclusion of 
all the CMB data. Indeed, for Model A, reasonable constraints are 
obtained on all the parameters under consideration, except for z re , 
although there is some indication that low values of z re are pre- 
ferred. For many of the parameters, the constraints are not signif- 
icantly broader when one allows the spatial curvature to vary in 
Model B. Nevertheless, for this model, some parameters do become 
relatively unconstrained, in particular h, £l m and £2 A , and the limits 
on the age of the Universe are widened considerably. 

Using all recent available cosmological data sets allows one 
to place tight constraints on nearly all parameters, even for Model 
B. Indeed, only z re remains relatively unconstrained, but the earlier 



indication that lower values are preferred appears to be reinforced. 
We note that, aside from the parameters h and the age of the Uni- 
verse, the constraints obtained are very similar for Model A and 
Model B. In fact, from Table 3, we see that, even for Models C and 
D, the constraints on the parameters plotted in Fig. 4 are not signifi- 
cantly broadened. In Fig. 5, we also plot the constraints obtained on 
the additional parameters R, n t , f v and w for Model D. We see that 
the data favour low values of R and high values of n t . More signifi- 
cant constraints can be placed on f v and w. We note from the figure 
that the marginalised distribution for each parameter possesses a 
single peak, although these are not pronounced features, especially 
for / v . Of particular interest is that the preferred value of w cor- 
responds to a cosmological constant and that a firm upper limit is 
obtained on the fraction of dark matter in the form of massive neu- 
trinos. 

To distinguish between the different cosmological models 
considered here, we have calculated the Bayesian evidence for each 
model-data set combination. It must be remembered that evidence 
values can only be compared between different models using the 
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Figure 5. Marginalised posterior probability distributions of additional pa- 
rameters for Model D using all data sets considered. 
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Figure 6. Marginalised posterior probability distributions for h and C0b for 
Model A, using all cosmological data, with (thick line) and without (thin 
line) the relevant direct constraints (dashed line) from the HST Key Project 
and nucleosynthesis respectively. 



same data set. From Table 3, we see that, for each data set, Model 
A (i.e. the simple 6-parameter ACDM cosmology) is preferred, al- 
though the evidence ratio between Model A and Model B is of order 
unity in each case. For Models C and D, however, the evidence ra- 
tio with Model A is considerably larger and shows clearly that such 
general models are not necessary to explain current cosmological 
observations. 

Since Model A is found to have the largest evidence, it is of 
interest to consider more closely the impressively tight parame- 
ter constraints that can be achieved in this simple case using all 
the available cosmological data. In particular, let us focus on h 
and <»(,, which are parameters directly probed by the HST Key 
Project (Freedman et al. 2001) and nucleosynthesis data (Buries 
et al. 2001). In Fig. 6, we plot the posterior distributions of these 
parameters for Model A, with (thick line) and without (thin line) 
the corresponding direct constraint (dashed line). In each case, we 
see that the direct constraint is overwhelmed by the combination of 
the other data sets. 

It is clear from the above analyses that the constraints on the 



parameters in the simplest models become impressively small when 
a wide range of cosmological probes are used. It remains to be 
seen, however, whether this level of precision is matched by a cor- 
responding level of accuracy in the values determined. The model 
comparison exercise demonstrated here does not address the pos- 
sibility that different experiments have different systematic errors. 
Nevertheless, one would hope that these might be reduced when 
many experiments are combined. In future work, we will investi- 
gate other models, with different combinations of free and fixed 
parameters and perform a hyperparameter analysis (Hobson et al. 
2002) to reveal any discrepancies between data sets. 
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